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Abstract —We propose an information-theoretic framework 
for phase retrieval. Specifically, we consider the problem of 
recovering an unknown vector x e I" up to an overall sign factor 
from m = [Rn] phaseless measurements with compression rate 
R and derive a general achievability bound for R. Surprisingly, 
it turns out that this bound on the compression rate is the same 
as the one for almost lossless analog compression obtained by 
Wu and Verdu (2010): Phaseless linear measurements are “as 
good” as linear measurements with full phase information in the 
sense that ignoring the sign of m measurements only leaves us 
with an ambiguity with respect to an overall sign factor of x. 

I. Introduction 

In many different areas of science, physical limitations 
make it impossible to measure the sign (phase in the complex 
case) of a signal but obtaining amplitudes is relatively easy. 
Well known examples are X-ray crystallography, astronomy, 
or diffraction imaging The problem of retrieving a 

signal up to a global sign (phase in the complex case) from 
intensity measurements is often referred to as phase retrieval. 
More formally, let R” be the set of equivalence classes 
[x] = {x}U{—x} with x £ K™. Phase retrieval is the problem 
of recovering [x] £ R” from m phaseless measurements of the 
forrrQ y = |Ax| £ R™ with measurement matrix A £ R mxra . 

It is by no means clear how large to has to be to allow for 
recovery of [x] £ R" from m phaseless measurements. Thus 
from the very beginning, there have been a number of works 
regarding recovery conditions for this problem in the context 
of specific applications |4). More recently, this question has 
been studied in more abstract terms, asking for the smallest 
number m of phaseless measurements that is required to make 
the mapping [x] i—► |Ax| injective without imposing structural 
assumptions on A. In Q, the authors showed that at least 
2n — 1 such measurements are necessary and generically 
sufficient to guarantee injectivity. Furthermore, it was shown 
that semidefinite programming can be used to recover [x] if A 
is random with i.i.d. Gaussian entries or with i.i.d. rows that 
are uniformly distributed on a sphere, as long as m > cqu for a 
sufficiently large constant cq 0. Other phase retrieval methods 
for which theoretical performance guarantees are available can 
be found, e.g., in O-Qo). 

Recently, there has been also interest in sparse phase 
retrieval , where the number s of nonzero coefficients of the 
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1 For a vector u E , we define the element-wise absolute value operation 

as |u| = (|ui|,..., |«fc|) T 


vector x is much smaller than n. This a-priori knowledge 
about x can be used to reduce the number of measurements 
significantly. For instance, 0(s log(n/s)) measurements were 
shown to be sufficient for stable sparse phase retrieval HD- 
If the rows of the measurement matrix A are a generic choice 
of vectors in R n , injectivity of the mapping [x] i—► |Ax| is 
guaranteed provided that to > 2s m. 

Contributions: Following the approach introduced for com¬ 
pressed sensing m and signal separation M problems, we 
formulate phase retrieval as an analog source coding problem. 
Assuming that the unknown vector x is random with a certain 
distribution, we derive asymptotic recovery results for [x]. 
Our results hold for Febesgue almost all (a.a.) measurement 
matrices A. However, our results are in terms of probability 
of error (with respect to the distribution of x) and hence 
do not provide worst-case guarantees. Specifically, we study 
the asymptotic setting n —> oo where the vector x is a 
realization of a random process; for each n, we let m = [ Rn] 
for a parameter R , which we denote compression rate. In 
Theorem |T]\ve show that we can recover [x] from m phaseless 
measurements with arbitrarily small probability of error for 
a.a. measurement matrices A, provided that n is sufficiently 
large and the compression rate R is larger than the (lower) 
Minkowski dimension compression rate (see Definition |4| of 
x. It is remarkable that the obtained result is identical to 
the corresponding result in compressive sensing Q3) where 
y = Ax, so that we can conclude that in terms of achievability 
results, phaseless linear measurements are “as good” as 
linear measurements with full phase information: Ignoring the 
sign of to measurements only leaves us with an ambiguity with 
respect to an overall sign factor of x. 

Notation: Roman letters A, B,... and a, b,... designate de¬ 
terministic matrices and vectors, respectively. Boldface letters 
A, B,... and a, b,... denote random matrices and random 
vectors, respectively. For the distribution of a random matrix 
A and a random vector a, we write p a and /i a , respectively. 
The ith component of the vector u (random vector u) is Ui 
(Ui). The superscript T stands for transposition. For a matrix 
A, tr(A) denotes its trace. The identity matrix of suitable size 
is denoted by I. For a vector u, we write ||u|| = v/u T u for its 
Euclidean norm. For the Euclidean space (R fe , || • ||), we denote 
the open ball of radius r centered at u £ R fc by £>fc(u, r), 
V{k,r) stands for its volume. The Borel sigma algebra on R 
is denoted by We write R> for the set of nonnegative 
real numbers with Borel sigma algebra . For u, v £ R fc , 


u ~ v means that either u = v or u = —v and we write for 
the corresponding equivalence classes [u] = {u} U {—u}. For 
a set S C R fc , = {[u] | u £ 5}. The indicator function on 
a set IA is denoted by xu- 

II. Main Results 

We start by formulating phase retrieval as a source coding 
problem. 

Definition 1. (Source vector) Let be a stochastic 

process on (K N ,^g N ). Then, for n £ N, the source vector 
x of length n is given by x = (xi,...,x„) T £ R". 


The sets U in the definitions for a n (e) and a n (e) are assumed 
to be nonempty and bounded. 

Example 1. The source vector x from DefinitionQjhas a mixed 
discrete-continuous distribution if for each n £ N the random 
variables x ? , i £ {1..... n}, are independent and distributed 
according to 

p Xi = (1- X)p d + Xp c , i £ {l r ..,n} 

where 0 < A < 1 is the mixing parameter, p c is a distribution 
on (R, Br), absolutely continuous with respect to Lebesgue 
measure, and //,/ is a discrete distribution. Then, lfl3l Th. 15] 


Definition 2. (Code, achievable rate) For x as in Definition 
\J\and e > 0, an ( n,m) code consists of 

(i) measurements |A • | : R™ — > R>; 

(ii) a decoder g : R> — > 1" that is measurable with respect 

to and 3^ n . 

We call R with 0 < R < 1 an e-achievable rate if there 
exists an N(e) £ N and a sequence of (n, [Rn\) codes with 
decoders g such that 

P[ff(lAx|) / x] < £ 

for all n > N(e). 

Next, we introduce the Minkowski dimension compression 
rate for source vectors. 


Definition 3. (Minkowski dimension) Let IA be a nonempty 
bounded set in R". The lower Minkowski dimension of IA is 
defined as 

dim B («) = liminf iogNu ^ 

log X 

and the upper Minkowski dimension ofU is defined as 

dW B (W) = limsup l ° SNu 1 (P) 
p—>0 log j 

where Nu(p) is the covering number oflA given by 
N u (p) = min jfc £ N | IA C (J p), u* £ R"|. 

ie{i,..,fc} 


If dim P (IA) = dime ((7), we write dims (£7). 

Definition 4. (Minkowski dimension compression rate) For x 
from Definition [7] and £ > 0, we define the lower Minkowski 
dimension compression rate as 

7? B ( £ ) = limsup a n (e), where 

n—f oo 


dim p (IA) 


a„ (e) = inf | lnl|i - WcR", P[x£W] >1 — e|. 


and the upper Minkowski dimension compression rate as 


Rb (e) = lim sup a n (e), where 

n—f oo 


_ , \ . ( fdim B (W) 

a n (e) = inf <- 

l n 


U CR", P[x£W] > 1 -e}. 


77b( £ ) = = A, 0 < £ < 1. 

The following result states that every rate R > Rb( £ ) I s 
£- achievable for Lebesgue a.a. matrices A. 

Theorem 1. Let 0 < £ < 1 and x as in Definition [7] Then, 
for Lebesgue a.a. matrices A £ R mxrl with m = \Rn\, R is 
an e-achievable rate provided that R > 

Proof. Since R > R^s) and m = [Rn\, Defmition|4]implies 
that there exists a sequence of nonempty bounded sets lA n C 
R” and an N(e) £ N such that 

dim B (W) < m (1) 

P[x£W] > l-£ (2) 

for all U = U n with n > N(e). In the remainder of the proof 

we assume that n is sufficiently large for <|7} and (0 to hold. 
The claim now follows from Proposition |T| below. □ 

Proposition 1. Let e > 0, x £ R n a random vector, and 
IA C R ra a nonempty bounded set with P[x £ IA] > 1—£. Then, 
for Lebesgue a.a. matrices A £ R mxn , there exists a decoder 
g with P[g(|Ax|) fi* x] < £ provided that dim p (7f) < to. 

Proof. See Section [III] □ 

Remark 1. By HU Sec. 3.2, Properties (i)-(iii)], the lower 
Minkowski dimension of any bounded nonempty subset in R” 
containing only vectors with no more than s nonzero entries is 
at most s. Therefore, Proposition [T] implies that any s-sparse 
random vector x £ R" can be recovered with arbitrarily small 
probability of error (by increasing the size of the set IA in 
Proposition Q}, provided that m > s. This result holds for 
an arbitrary distribution of x and a.a. matrices A £ R mxrl . 
The best known recovery threshold for deterministic s-sparse 
vectors is to > 2s 03. 

Remark 2. It is worth noting that formally phase retrieval 
can be formulated as a matrix completion problem with 
measurements yf = tr(a,;aJxx T ) using rank-one measurement 
matrices A, = a,aj, i = l,.,.,m. However, compared to the 
rank-one measurement matrices used in the matrix completion 
problem 131, 133, the matrices a^aj are symmetric. This 
complicates the proof of Proposition|T|significantly and forces 
us to develop a novel concentration of measure result (Lemma 
0. On the other hand, in phase retrieval we are interested 
in recovering symmetric rank-one matrices xx T (which is 








equivalent to the recovery of [x]), whereas matrix completion 
deals with the recovery of arbitrary low-rank matrices. 

In the mixed discrete-continuous case we can strengthen the 
result of Theorem Q] through the following lemma. 


where © follows from the definition of the decoder. Fix an 
arbitrary r > 0. Suppose that we can show that 

P(x) = P[3u £ U with u/x, |Au| = |Ax|] =0, xgll 

(4) 


Lemma 1. Let 0 < e < 1 and x be distributed according to 
the mixed discrete-continuous distribution in Example [7] with 
mixing parameter A. Then, for Lebesgue a.a. matrices A £ 
M mxn with m = \_Rn\, R is e-achievable provided that R > 
A. Moreover, R > A is also a necessary condition for R being 
e-achievable. 


Proof. Achievability: Follows from Theorem Q] and Example 
Q] Converse: Suppose that a rate R < A is e-achievable for 
some e with 0 < e < 1. This implies that there exists a set 
KCR" and a matrix A £ R mxn with m = \Rn\ such that 

(a) Pr[x £ K] > 1 — e; 

(b) | A • | is one-to-one on 1C„ 

for n sufficiently large. From ( b ) it follows that there can be at 
most one equivalence class [u] £ 1C^ with Au = A(—u) = 0 
(if there was more than one such equivalence class then the 
mapping |A • | would not be one-to-one on /C^). 

Suppose first that there is no equivalence class [u] = 
{u, —u} £ 1C^ with Au = A(—u) = 0 and u f 0. Then, 
(b) implies that A is one-to-one on 1C which, together with (a) 
and R < A, leads to a contradiction to the converse part of 
H2 Thm. 6], 

Now suppose that there is an equivalence class [u] = 
{u, —u} £ with Au = A(—u) = 0 and u ^ 0. Let R 
be such that R < R < A and set m = [Rn\. Then, m > m 
for n sufficiently large. Let A = (A T , u, 0,..., 0) T £ R mxn . 
Then, (b) implies that A is one-to-one on 1C which, together 
with (a) and R < A, leads to a contradiction to the converse 
part of fl3l Thm. 6], □ 

III. Proof of Proposition Q] 

Let 


-P(y) = {u £ R"|u £ U , |Au| = y} 

U {u £ R"| — u £ U , |Au| = y}, y £ R>. 

For a vector u £ J-( y) \ {0}, let u denote the first nonzero 
component of u. We then define the reduced set 


P(y) = {u £ F(y) \ {0}|m = |u|} U (P(y) n {0}), y £ R>. 
We define the decoder g : R> —> R” by 


g( y) = 


u, if R(y) = {u} 


e, 


else 


where e is some fixed vector in the complement of U (used 
to declare a decoding error). Then, we have 


P[s(|Ax|) f x ] 

= P|s(|Ax|) /x,xgW] +P[s(|Ax|) ^x,x^W] 
< P[.9 (|Ax|) /x,xeW] +£ 

= P[3u £ U\u f x, |Au| = |Ax|, x £ U\ + e 


where A £ K m x has independent rows that are uniformly 
distributed on B n (0,r). Then, 

J P[3u £ U\u x, |Au| = |Ax|,x £ U] dp a 

A(r) 

= J P[3u £ U with u f x, |Au| = |Ax|] d^ x 
u 

= 0 (5) 

where we used Fubini’s Theorem and set A(r) = B n (0,r) x 
... x B n (0,r). Since r is arbitrary, © implies that 

P[3u £ U\n f x, |Au| = |Ax|,x £ U] = 0 (6) 

for Lebesgue a.a. matrices A. Hence, combining © and © 
proves the Proposition provided that we can show that © 
holds, which is done in Section [TV] 

IV. Proof of © 

Suppose first that x = 0. Then, P(x) =0 if and only if 

P [3u £li\ {0} with Au = 0] = 0. (7) 

Since dim P (id) < m, © follows from |jl4l Prop. 1], There¬ 
fore, we can assume in what follows that x ^ 0. 

We can upper-bound P(x) < Pi(x) + P 2 M with 

Pj(x) = P[3u £ Ui(x) with |Au| = |Ax|], i £ {1, 2} 
where we defined 

U\ (x) = {u £ U | rank(x, u) = 2} 

U-iix) = {u £ U | rank(x, u) = 1} \ {u £ U\u ~ x}. 

We have to show that P,(x) = 0 for i £ {1,2}. First, we 
establish P 2 (x) = 0. We have (recall that x/0) 

p 2(x) 

= P [3u £ U with rank(x, u) = 1, u 'f x, | Au| = | Ax|] 

= P [Ax = 0] 

= 0 

where we used M Prop. 1] together with dim B ({x}) = 0 in 
the last step. It remains to show that Pl(x) = 0. To this end, 
we first present an auxiliary lemma. 

Lemma 2. Let r > 0, 0 f S C B n (0, L), p > 0, x £ 
B n (0,L), and A £ R mxn with independent rows that are 
uniformly distributed on B n (0,r). Then, there exist s i(p) £ S, 
l = 1 ,...,Ns{p) with Ns(p) being the covering number of S, 
such that 

P[3u £ S with |||Au| — |Ax||| < p\ 

Ns (ft) 

< ^2 p [|l aT s;(p)| 2 - |a T x| 2 | < 2ir(2r + l)p] m (8) 
1=1 


( 3 ) 


where a is uniformly distributed on B n (0,r). 

Proof. Let 5 C Uie{i,..,tv s (p)} B n(Mp), p), Mp) € R n , 

be a minimal covering of S according to the definition 
of the covering number, cf. Definition [3] Then, there exist 
s i{p) G S n B n {vi{p),p) for all l = 1 ,...,N(p). Hence, the 
balls B n (si(p), 2p) cover the set S and have centers in S. We 
can upper bound the lhs in ([Sj by 

P[3u G S with || |Au| — | Ax111 < p] 

Ns(p ) 

< ^2 P[3u G S n B n (si(p), 2 p) with || |Au| — |Ax| || < p] 

i=i 

Ns(p) 

< ^2 P[3u G S n B n (si(p), 2 p) with | |a T u| — |a T x| | < p\ m 

i=i 

(9) 

where © follows from the fact that the rows of A are 
independent and uniformly distributed on B n { 0, r). Using the 
triangle inequality we obtain 

||a T sz(p)| - |a T x|| < ||a T x| - |a T u|| + ||a T u| - |a T s z (p)||. 

(10) 

The second term on the rhs of (flOt can be further upper 
bounded by 

| |a T u| - |a T s z (p)11 < |a T (u - Si(p))\ 

< ||a||||u-s;(p)|| 

<2 rp (11) 

where (UTb follows from u G B n (si(p), 2 p) and a G B n ( 0, r). 
Combining (U0> and m gives 

| |a T x| - |a T u| | > | |a T s z (p)| - |a T x|| - 2 rp. (12) 

Using (fl2l > in © yields 

P[3u G S with |||Au| — |Ax||| < p] 

Ns(p ) 

< p [|l aTs i(p)l - l al " x l| < ( 2r + i)/ 3 ]™ 

1=1 

Ns(p) 

< ^22 P[||a T s z (p)| 2 - |a T x| 2 | < 2Lr(2r + l)p]"' (13) 

1=1 

where (IT3l > follows from | |a T s z (p)| 2 — |a T x| 2 1 = I(|a T s;(p)| + 
|a T x|)(|a T sz(p)| - |a T x|)| < 2Lr||a T s z (p)| - |a'x||. □ 

We now continue with the proof of Pi(x) = 0. Since U is 
a bounded set, there exists an L G R such that 

||u|| <L, uGW. (14) 

We define the sets T 3 (x) by 


it is sufficient to prove that 

P 1 °' ) (x) = P[3u G 7}(x) with |Au| = |Ax|] = 0 


for all j G N. Suppose, by contradiction, that there exists a 
j G N such that P < f ' 1 (x) > 0. Then, 


lim inf 

p—¥ 0 


log P[^ (x) 

log^ 


= 0. 


(15) 


Furthermore, 7)(x) f 0 and by jT5] Sec. 3.2, Property (ii)] 
(recall that 7j(x) C Wi(x) C U ) we get 

dim P (Tj(x)) < in. (16) 


We have 


l iminf lorf(x) 
p -*-0 log i 


= lim inf 
p—>• o 

< lim inf 
p—>• o 


log P [3u G Tj(x) with | Au| = |Ax|] 


log (ES <X)(P)p | l aT s z 0) (p, x)| 2 — | aTx | 2 | < p ) 


log- 


log (p m ES (X) ^ /(AUS z 0) (p,x),x) ) 


< lim inf 
p—> o 


lo g^ 


< liminf log (p m NT j(x) (p)f(p,r,L,j) m ) 

p^O log i 


= dim p (Tj (x)) — to + to lim 


. log f(p,r,L,j) 


p-10 log i 


= dim P (Tj (x)) — to 
< 0 


(17) 

(18) 
(19) 


( 20 ) 


where in (fl7T> we applied Lemma |2] with S = 7)(x) and set 
p = 2Lr(2r + 1 )p, (IT8l > follows from Lemma [3] below with 
u = sf 1 (p. x), v = x, and 5 = p where / is defined in (l22l> . 
in © we used that 


f(p,r, s[ j) (p,x),x) 
< f{p,r,L,j) 

2 j 


2(2r) n ~ 2 r 


V(n,r) 


(l + log ( 


2 + 


8 r 2 L 2 \\ 

p ))’ 


l — I;---) N Tj(x)(p) 


which follows from dTTb and the fact that s ^\p,x) G 7) (x), 
l = 1 Nj- f^(p), and in (l20l > we applied (IT51) . But (ljUl) is 
a contradiction to © . Therefore, p[^ (x) = 0 for all j G N, 
which implies in turn that P\ (x) = 0 and concludes the proof 
of ©. 


75 ( x ) 

Since 


ueWi(x) ^||u|| 2 ||x|| 2 - |u T x| 2 > 


JGN. 


A(x) < ^P[3u G 7)(x) with |Au| = |Ax|] 

few 


V. Concentration of measure result 
Lemma 3. Let r > 0, a be uniformly distributed on B n (0, r), 
C = uu T — vv T with linearly independent vectors u, v G R n , 
and S > 0. Then 

P[|a T Ca| < <5] < Sf(S,r, u,v) 


( 21 ) 















with 


f(S,r, u,v) = 


2 ( 2 r) n - 2 (l + log ( 2 +^ 


u+v u-v - Hr- v 


0 ) 


\/ll u ll 2 ll v ll 2 - |u T v| 2 y (n, r) 


( 22 ) 


Proof. We have 
P[|a T Ca| < 5] 


< 


V(n, 

r) 


B, 

1 


V(n, 

r) 


B, 

1 


V(n, 

r) 


B, 

(2 f) r 

i-2 

V (n. 

■ r) 


X 


|aSR n | |a T Ca|<5| 


da 


X 


{aGR n ||a T WRJR T W T a|<i} 


da 


X 


|beR n ||c T RJRTc|<i} 


X 


B 2 (0,r) 


|cGR 2 ||c t RJR t c|<«5} 


db 


dc 


(23) 

(24) 

(25) 


where (l23l > follows from Lemma [4] with R and J defined in 
(l32l > and W defined in (l33l > and (l24l > follows from changing 
variables to a = Wb with W = (W, Z) £ K" xn where Z £ 
K n x("“ 2 ) is chosen in such a way that WW T = I and c = 
(ci, C 2 ) t with ci = 61 and C 2 = 62 . 

The bound (l36l > on the determinant of the matrix RJR T 
implies that one eigenvalue of RJR T , say Ai, is positive and 
the other eigenvalue of RJR T , say —A 2 , is negative. We can 
assume without loss of generality that Ai > A 2 . Using the 
eigendecomposition RJR T = U diag(Ai, — A 2 )U T , where U £ 
M 2x2 with UU T = I, and changing variables to c = Ud, we 
can further upper bound (OH by 


( 2 rr 


V(n,r) 


b 2 ( 0,1 


C {cGR 2 ||c t RJR t c|<(5} 


dc 



Fig. 1. Intersection of the rectangle {t | t 2 < Air 2 ,t| < A 2 r 2 } with the 
two hyperbolas {t | f 2 — = ±5} for 5 = 1, Ai = 16/r 2 . and A 2 = 4/r 2 . 


{t | t\ — = ±(5} (see Figure [T|i. The bound (OH can then 

be established by performing the following to steps: 

1 ) deriving an upper bound on the integral in (OH . 

2) finding an expression of the eigenvalues of RJR T in terms 
of the vectors u and v, 

which will be done next. We have 


(2 r) 


n —2 


y/AlA 2 V{n,r) J X {tSR 2 |t 2 <A 1 r 2 ,i 2 <A 2 r 2 } 


R 2 


x X 


|t£R 2 11— 1 || <5 J 


dt 


< 


(2r) 


n —2 


v/Al MV(n,r ) J X {t^ 2 \tl+t 2 2 <s+2\ 2 r 2 } 


X x {teR 2 ||t 2 -i 2 |<< 5 } dt 


(28) 


/ 


(2 r ) n ~ 2 

V(n, r) J X {dGR 2 ||A 1 d 2 -A 2 d 2 |<5} 

8 2 (0,r) 

( 2 r ) n_2 f 

WM J X {teR 2 |^- + ^<r 2 ) 

R2 f J 


dd 


( 2 r ) n_2 f 

VA 1 A 2 V(n,r) J '^{zeR 2 |zJ+z^<(5+2A 2 r 2 | 

R 2 

X X {zGR 2 ||z 1 z 2 |<|} dZ 


(29) 


X ^tGR 2 lh 2 f 2 l<H dt (26) < (2r)W 2 f v, , 

l 1,1 a| / VMMV(n,r) J A {z6R 2 |z 2 <5+2A 2 r 2 ,z 2 <,5+2A 2 r 2 } 


< 


(2 r ) r 


j X {t£R 2 i 2 <Air 2 ,t 2 <A 2 r 2 } 

X '^-|zGR 2 | 

R 2 



dz 


X x {teR 2 ||t 2 -t 2 |<5} dt (27 - ) 

where in (f26t we changed variables to t = diag(v / A7, V^ 2 )d- 
The integral in (07l > measures the area that is inside the 
rectangle {t | t\ < Ai r 2 ,f| < A 2 r 2 } and the two hyperbolas 


4(2r )" -2 
v / A 1 A 2 U(n, r) 



zi<\/i5+2A 2 r 2 | 


X X 


{z£R 2 | ,Z2<min (v^A.r 2 ,^)} 


dz 


































x X {z£R 2 | z 2 < y/5+2 A 2 r 2 } 



where in (l28l > we used that t| < A 2 r 2 and \t\—1\\ < 6 imply 
t\ + f| < <5 + 2A 2 r 2 , and in ( l29l > we applied the orthogonal 
transformation z\ = (1/\/2)(ii + i 2 ), £2 = (l/v/2)(ii — b)- 
Combining (1251) with (l30l > and using the expressions (l36l > and 
<ES> gives C3. □ 


Using the definitions of the vectors a and b in (l34l > and 
(l35l i. we can rewrite 



= WR 


U V 

M 

uv . 

1 RF 


which proves (PH . 

The explicit form of the determinant in (PH follows from 
the fact that 


det(RJR T ) = det(R) det(J) det(R T ) 
= — | det(R)| 2 


u T v 


= |u T v| 2 - u T uv T v 


2 


VI. Properties of certain rank two matrices 

Lemma 4. Let u, v £ K" be linearly independent and C = 
uu T — vv T . Then, 


C = WRJR t W t 


with 


J = 



and 


ll u ll M ^ 

0 Ilv-Tj^ullj 


W = 


(m’m) 


(31) 


(32) 


(33) 


where the orthonormal vectors a/||a|| and b/||b|| are defined 
by 


a = u (34) 

11 ' v 

b = v ^ Ti—rp7 a - (35) 


Moreover, 


det(RJR T ) 

= |u T v| 2 -||u|| 2 ||v|| 2 <0 

(36) 

tr(RJR T ) 

= INI 2 - IMI 2 

(37) 

er 2 (RJR T ) 

= |||u + v||||u-v|| - ||||u|| 2 -||v|| 2 | 

(38) 


where <Ti(RJR t ) > <t 2 (RJR t ) are the singular values of 
RJR t . 

Proof. We can rewrite C = AJA T with A = (u, v). Hence, 
to prove (PH . it is sufficient to show that A = WR. 


< 0 (39) 

where (P9l) follows from the Cauchy-Schwarz inequality fl8l 
Sec. 0.6.3] and u and v being linearly independent. The 
expression for the trace (P71) follows from tr(RJR T ) = tr(C). 
Finally, (PH) follows from 

cr 2 (RJR T ) = i(cri(RJR T ) + cr 2 (RJR T )) 

- i(ai(RJR T ) - cr 2 (RJR t )) 

= i^tr(RJR T ) 2 - 4det(RJR T ) - i|tr(RJR T )| 

= I|| u + v||||u-v||-i|||u|| 2 -||v|| 2 |. 

□ 
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